Incorporating Contextual Phonetics into Automatic Speech Recognition

نویسندگان

  • Eric Fosler-Lussier
  • Steven Greenberg
  • Nelson Morgan
چکیده

This work outlines the problems encountered in modeling pronunciation for automatic speech recognition (ASR) of spontaneous (American) English speech. We detail some of the phonetic phenomena within the Switchboard corpus that make the recognition of this speaking style difficult. Phonetic transcribers found that feature spreading and cue trading made identification of phonetic segmental boundaries problematic. Including different forms of context in pronunciation models, however, may alleviate these problems in the ASR domain. The syllable appears to play an important role, as many of the phonetic phenomena seen are syllable-internal, and the increase in pronunciation variation compared to read speech is concentrated in coda consonants. In addition, we show that other forms of context – speaking rate and word predictability – help indicate increases in variability. We present a dynamic ASR pronunciation model that utilizes longer phonetic contextual windows for capturing the range of detail characteristic of naturally spoken language.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...

متن کامل

Phonetics: The Key to High Quality Speech Recognition and Synthesis

Linguistic knowledge has been maligned by speech recognition researchers in the past as being harmful to good performance. However an examination of the advances in speech recognition and speech synthesis shows that many of them were made by incorporating linguistic knowledge into the systems. The most useful type of knowledge is of a quantitative, rather that qualitative nature. This is precis...

متن کامل

What Is Automatic Speech Recognition and Who Uses It?1

Paul De Palma Department of Computer Science Gonzaga University Spokane, WA [email protected] Research into automatic speech recognition (ASR) has a long history dating to the earliest days of computing. It began roughly at the same time that researchers first developed compilers for the early high-level programming languages. Bell Labs, RCA Research, and MIT’s Lincoln labs all used new ideas...

متن کامل

Speech recognition, sylabification and statistical phonetics

The classical approach in phonetics of careful observation of individual utterances can, this paper contends, be usefully augmented with automatic statistical analyses of large amounts of speech. Such analyses, using methods derived from speech recognition, are shown to quantify several known phonetic phenomena, most of which require syllable structure to be taken into account, and reveal some ...

متن کامل

Erik R . Thomas 7 Instrumental Phonetics ERIK

s of the Tenth International Congress of Phonetic Sciences.Dordrecht: Foris Publications.253–58.Ohala, John J. (1985). Linguistics andautomatic processing of speech.In R. De Mori and C. Y. Suen (eds.), New Systems and Architectures for Automatic Speech Recognition and Synthesis. Berlin: Springer-Verlag.447–75.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999